BI、Data Warehouse、Big Data Analytics and NoSQL

《Database Systems Design, Implementation, & Management》12 ed 是16年出版的,比较新,而且对一些新的内容作了深入细致的分析和讨论,第13章是BI和数据仓库(Business Intelligence and Data Warehouses),第14章是大数据分析与 NoSQL(Big Data Analytics and NoSQL),这些内容我在《数据库系统概念》和《数据库系统实现》上都没有看到(也可能是看的中文版少了一些新的内容),所以看一下这两章作为补充。BI和数据仓库不是什么新的概念了(对于技术的发展史来说),而大数据和NoSQL、NewSQL这几年才火了起来。虽然网上关于这些方面的中文资料也挺多,但是毕竟这是国外大牛写的系统性介绍,有利于对整体的概念有全面清晰的了解,所以还是要看一下这个。(文末有下载链接)

下面是这两章的简介:

Business Intelligence and Data Warehouses

In this chapter, you will learn:

  • How business intelligence provides a comprehensive business decision support framework
  • About business intelligence architecture, its evolution, and reporting styles
  • About the relationship and differences between operational data and decision support data
  • What a data warehouse is and how to prepare data for one
  • What star schemas are and how they are constructed
  • About data analytics
  • About online analytical processing (OLAP)
  • How SQL extensions are used to support OLAP-type data manipulations

(部分)本章介绍的相关概念和技术:

  • DSS(decision support system)
  • ETL(extraction, transformation, and loading)
  • MDM(master data management)
  • OLAP(online analytical processing )
  • ROLAP(relational online analytical processing)
  • MOLAP(multidimensional online analytical processing)
  • drill down, roll up, slice and dice
  • snowflake schema, star schema
  • VLDB(very large database)

Big Data Analytics and NoSQL

In this chapter, you will learn:

  • What Big Data is and why it is important in modern business
  • The primary characteristics of Big Data and how these go beyond the traditional “3 Vs”
  • How the core components of the Hadoop framework, HDFS and MapReduce, operate
  • What the major components of the Hadoop ecosystem are
  • The four major approaches of the NoSQL data model and how they differ from the relational model
  • About data analytics, including data mining and predictive analytics

(部分)本章介绍的相关概念和技术:

  • row-centric storage
  • column-centric storage
  • column family database
  • document database
  • graph database
  • key-value (KV) database
  • stream processing
  • structured data
  • unstructured data
  • NoSQL
  • NewSQL

我把自己对12章和13章中感兴趣的部分翻译了当做笔记,以利于日后回顾相关知识

电子书下载链接:http://pan.baidu.com/s/1i4UaFSp 密码:6ih7